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hJ ! Erdos introduced the notion of abelian square in a 1961 paper [1]: 

' 

c/3 ! Definition 1. A word w is an abelian square if there exist two words s and t such that 
w = st and t is a permutation of s. 

^ I At the DLT 2011 conference in Milan, Italy, Maxime Crochemore asked if the language 

\^ • of non-abelian squares is context-free. In this note, we answer his question. 

. Let S = {0, 1} be the alphabet in this context. Let L denote the language of all words 

^ , that are not abelian squares over E. Our goal is to prove the following: 

5 i Theorem 2. L is not context-free. 

^ , In order to prove this, we introduce some notation. Define Wi = 10* for all i > 1. Let 

^ ; R = wlW3W^W3Wl 

X : Lemma 3. The word <«;3^i^2'^"w^3ti^3^"'^"^ eLn (S^)* n R. 



Proof. Suppose z = w2w3W2'~^"'W3wl^'^'~^"'\ Clearly z e (S^)* fl R. Suppose m is the total 
number of I's in z. Then m = 3n! + 4n + 2. However, the number of I's in the second half of 
z is ^ -|- 2?7, + 1, which is not equal to y. Hence z is not an abelian square. Thus z G L. □ 

Definition 4. Let w = 0''"10'^^ ■ ■ ■ 10**= be a word over the alphabet S. We define 

alt(w;) = \{1 < i < k : Si 
Define alt(-) over a language K as follows: a\t{K) = max^^jg^ alt(w). 

Definition 5. A sequence of non-negative integers {ak)^=i is called uneven if n > 1 and 
3i G [l,n] such that ^ cii+i. Here a„+i = ai. Otherwise, it is called even. 
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Definition 6. Suppose w is a word that contains a 1. Then w is called uneven if the 
sequence (si + s^+i, S2, . . . , s^) is uneven, where w = O^^W^l ■ ■ ■0^'^10^''+^. Otherwise, it is 
called even. 

Example 7. Suppose vj is an even word and of the form O^^lO^^l ■ ■ ■ O^^lO^^+i. Now we 
consider what w looks like. Since w is even, we get that Si + s^+i = S2 = ■ ■ ■ = Sk- Then 
w = On(0"+*l)'=0* for some s,t,k> 0. 

Lemma 8. If w is an uneven word, then alt(w'^) > k — 1. 

Proof. Suppose w = O'HO'n ■ ■ ■ 0'nO'^+^. Then w'' = 0'^{W^ ■ ■ ■ I0"'=10"'=+i+"i)''-il0"2 ■ ■ ■ 10^^ 
10'''=+i. Since w is uneven, we get alt(10''2 ■ ■ ■ 10'''=10'''=+i+''i) > 1. It follows that alt(«;^) > 
k-1. □ 

Lemma 9. Let T = Ln (S^)* fl R. Then T is not context-free. 

Proof. (By Ogden's Lemma) For any n > 4, let 2; = w2wsvu2'~^"'wswl^^'~^^\ By Lemma [3] we 
see that z & T. Mark the first 4n bits of z, that is, the bits corresponding to w2. Let m{s) 
denote the number of bits marked in s. Now we show by contradiction that no decomposition 
z = UoVoWoXoVo satisfies all the following three conditions: 

1. condition A: m{voXo) > 

2. condition B: m^VoWoXo) < n 

3. condition C: Vz > 0, UqvIwqXqUq G T 

Before further consideration into all possible decompositions, we first mark z with dif- 
ferent colors. Mark the bits corresponding to w1 red. Mark the next 3 bits corresponding 
to W3 blue. Mark the bits corresponding to w'^'^'^ green. Mark the bits corresponding to 
^s'W^a''"'^"^ black. Define a new function m(color, x) as the number of bits in x colored color. 
Note that m{x) in our former definition is the same as m(red, x). Here is a picture of how z 
is colored: 

WiWi ■ ■ -Wi W3 W2 - ■ ■W2 W3W3 ■ ■ - Ws 
red blue green black 

Now we list all possible cases. 

(i) Either f or x is the empty word. Without loss of generality, suppose x is empty. 

(i) V e 0+. 

(ii) V contains a 1 and v is uneven. 

(iii) V contains a 1 and v is even. 

(ii) Both V and x are non-empty words. 

(i) V eO^ or X e 0+. 
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(ii) Both V and x contain a 1; f is uneven or x is uneven. 

(iii) Both V and x contain a 1 and are even. 

(i) m(red,v) = 0, which means that w2 precedes v in z. 

(ii) m(red, v) > 0. 

(i) m(red, x) > 0. 

(ii) m(red, x) = and m(green, x) > 

• m(blue, x) = 3. 

• m(blue, x) = 2. 

• m(blue, x) = 1. 

• m(blue,a;) = and m(green,a;) > 1. 

• m(blue, x) = and m(green, x) = 1. 

(iii) m(red, x) = m(green, x) = and m(black, x) > 

Suppose there exists a decomposition z = uvwxy satisfying the above three conditions 
simultaneously. 

Case i: First we consider the case when either v or x is empty. Without loss of generality, 
suppose X is empty. Then v cannot be empty, since vx is non-empty. 

Case i.i: Suppose v = O'' for some k G N^. Then we select i = 4. Since there are more 
than 3 successive O's in v'^, this is also true for uv'^wx'^y. However, no word in T contains 
more than 3 successive O's. Hence we get a contradiction. 

Case i.ii: Suppose v contains a 1 and is uneven. We pick i = 6. Then alt(v^) > 5 > 
alt(i?) = 4 by Lemma [H So uv^wx^y ^ T, which violates condition C. 

Case i.iii: Now we consider when v is even. In this case v can be written in the form 
gfc-|^^gfc+s]^-jpgs some k,s,p G M (as we mentioned in Example [7]). Then it follows that 
m(green, v) = and A; + s = 3 by the following argument. Suppose m(green, v) > 0. Then 
m(blue,v) = 3. That is to say, the ws between w^s and u'2's lies in v. Then v must be 
of the form ri01001r2 for some words ri and r2- It follows that k + s = 2, since v is even. 
Now we select i = 2. Then uv'^wx'^y = w2w^3W2'~^"'W3wl^"''~^"'^ for some / > 1, which violates 
condition C. Now suppose + s 7^ 3. Then we pick i = 2. It follows that uv'^wx'^y is of the 
form w\w'^f^J]^^w\w'iW2'^'^W'iw'f^^'^^'' ^ T, which violates condition C again. Now let i = 

Ti f T1 j-t, J- i i n\+n n\+n 2(n!+n) / n\+n n\+ri\ / 2(n\+n)\ . 

It follows that Zi = uv wx^y = 10^^ W3W2 W3W3 = [w^^ W3W2 jlw^w^ ) is an 
abelian square, a contradiction. 

Case ii: Both v and x are non-empty. In this case, we first show that both v and x 
contain a 1. Then, we show v and x are even. Finally we rule out all subcases under the 
condition that v and x are even. 

Case ii.i: Suppose f = 0^ or x = O' for some k, I G N"*". By a similar analysis in Case i.i, 
we get that this case violates condition C. 
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Case ii.ii: Suppose v is uneven. By a similar analysis in Case i.ii, we see that this case 
violates condition C. The same applies to the case when x is uneven. 

Case ii.iii: Now it remains to consider when v and x are even. Suppose v — 0'^1(0'^"'"*1)^0* 
for some k^s,p E N, and x ~ 0'^1(0'^"'"'^1)'^0'^ for some c, d, e G N. 

Case ii.iii. i: First of all we consider the case when m(red, = 0. Then m(red,a;) = 
since x precedes v in z. It follows that m(red, vx) = 0, which violates condition A. 

Case ii.iii. ii: Now we turn to the case when m(red,?;) > 0. By the same argument in 
Case i.iii, we get that m(green, v) = and k + s = 3. Note that p < n, for otherwise the 
condition m(green, v) = cannot be satisfied. Now we consider the following subcases: 

Case ii.iii. ii.i: If m(red, x) > 0, then m(green, x) = and c + d = 3. After selecting i = 

we get that Zi = uv^wx^y = w^^ W3W2 w^w^ = {w2 W3W2 jiwsw^ ) is 
an abelian square, which violates condition C again. 

Case ii.iii. ii.ii: If m(red, a;) = and m(green, x) > 0, then m(blue,x) < 3, for otherwise 
X cannot be even. There are again four subcases here. 

1. The first subcase is m(blue, x) = 2. Then x is in the form OOlri, where ri is any word. 
We see that (a) ri = e or (b) ri = 0, for otherwise x cannot be even, (a) If ri = e, then 

X = 001. We pick ^ = 2. It follows that uv^wx^y = <+^p+^«;|«;^'+"m;3«'3^"''^"^ ^ T, 
which voilates condition C. (b) If ri = 0, then x = 0010. We pick i = 2. It follows 
that uv^wx^y — w2~^'^^~^'^W3W4W2'~^'^Wswl^"''~^"'^ ^ T, which also violates condition C. 

2. The second case is m(blue, x) = 1. Here is a picture. 

z — W4W4 • ■ ■ tt;4l0 ^0w2 ■ • y ■ ■ W2W3 ■ ■ - w^ 

X 

Thus (a) X = 010 or (b) x = (01)' for some / > 0. (a) If x = 010, then we pick i = 2. 
It follows that uv^wx^y = w2^^^^^wlw^-+''w3wf''-^''^ ^ T. (b) Otherwise x = (01)' for 

n! 



some I > 0. After picking i — we get 



n\+n "!+"+TT^ 2(n!+n) 
Zi — w2 W3W2 ^'^WsW^ 

I n!+n n!+n 2(l+p)\/ 2(l+p) 2(n!+n)\ 
= [W^ ^ W3W2 W2 ){W2 W3W2 ) 



Note that 2{i+p) integer since n > 4. Therefore Zi is an abehan square. Hence 

Zi ^ T, a contradiction. 

3. The third case is m(blue, x) — and m(green, x) > 1. Similarly x must be of the form 
(01)' or (10)' for some I e N+, since x is even. We pick i — ^ and find the same 
result as in the second case. 
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4. The last case is exactly when m(blue,x) = and m(green,x) = 1. Then x has to be 
the first or the last letter of the substring w'^'^'^ of z, since x cannot be a single or 1 
(there are no successive I's in elements of R). Moreover, we find that x cannot be the 
first letter of ^2'"*""^! since m(blue, x) = 0. It follows that x is the trailing of W2^'~^"'. 
Under this circumstance, we find that x = 0(100)'^10, since x is even. Now we select 
i = It follows that 

Zi = uv^wx'^y 

n\+n n\+n 2(«!+n)+ 

= W^^ W3W2 ^ W3W3 

(l + e)n! 

/ n\+n n\+n\ i+v I 2(n!+n)\ 

(l + e)n! (l + e)n! 

/ n!+n n!+n 2(l+p) \/ 2(l+p) 2{n!+n)\ 



Note that ^2(i+p) integer since n > 4. Thus Zi is an abelian square. Therefore 

Zi ^ T, a contradiction. 



Case ii.iii.ii.iii: The last possible subcase is when m(red, x) = and m(green, x) = and 

1+p' 



m(black, x) > 0. In this case, we get that c + d = 2. Now we pick i = j^. It follows that 



Zi = uv wx y 



„!+„ 2(n!+n)+(i±£M 

(l + e)ra! 
(l + e)n! (l + e)n! 

/ n\+n n\+n 2(i+p) \/ 2(i+p) 2(n!+n)\ 



Note that ^2(i+p) integer since n > 4. Thus Zi is an abelian square. Therefore Zi ^ T, 

a contradiction again. 

With the above discussion, we claim that no decomposition of z can satisfy all three 
conditions simultaneously. Thus T is not context-free. □ 

Proof. Combining Lemma [9] with the closure property, that the language obtained by inter- 
secting a context-free language and a regular language is still context-free, we finally obtain 
this conclusion. □ 
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